Coordinated Exploration in Concurrent Reinforcement Learning

نویسندگان

  • Maria Dimakopoulou
  • Benjamin Van Roy
چکیده

We consider a team of reinforcement learning agents that concurrently learn to operate in a common environment. We identify three properties – adaptivity, commitment, and diversity – which are necessary for efficient coordinated exploration and demonstrate that straightforward extensions to single-agent optimistic and posterior sampling approaches fail to satisfy them. As an alternative, we propose seed sampling, which extends posterior sampling in a manner that meets these requirements. Simulation results investigate how per-agent regret decreases as the number of agents grows, establishing substantial advantages of seed sampling over alternative exploration

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A reinforcement learning approach to coordinate exploration with limited communication in continuous action games

Learning automata are reinforcement learners belonging to the class of policy iterators. They have already been shown to exhibit nice convergence properties in a wide range of discrete action game settings. Recently, a new formulation for a Continuous Action Reinforcement Learning Automata (CARLA) was proposed. In this paper we study the behavior of these CARLA in continuous action games and pr...

متن کامل

An RL Approach to Coordinate Exploration with Limited Communication in Continuous Action Games

Learning automata are reinforcement learners belonging to the category of policy iterators. They have already been shown to exhibit nice convergence properties in discrete action games. Recently, a new formulation for a Continuous Action Reinforcement Learning Automaton (CARLA) was proposed. In this paper we study the behavior of these CARLA in continuous action games and propose a novel method...

متن کامل

Eecient Exploration in Reinforcement Learning

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

متن کامل

cient Exploration In Reinforcement Learning Sebastian

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

متن کامل

Single-Agent vs. Multi-Agent Techniques for Concurrent Reinforcement Learning of Negotiation Dialogue Policies

We use single-agent and multi-agent Reinforcement Learning (RL) for learning dialogue policies in a resource allocation negotiation scenario. Two agents learn concurrently by interacting with each other without any need for simulated users (SUs) to train against or corpora to learn from. In particular, we compare the Qlearning, Policy Hill-Climbing (PHC) and Win or Learn Fast Policy Hill-Climbi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.01282  شماره 

صفحات  -

تاریخ انتشار 2018